Neural learning for distributions on categorical data

نویسندگان

  • F. Xabier Albizuri
  • Ana Isabel González
  • Manuel Graña
  • Alicia D'Anjou
چکیده

F.X. Albizuri, A.I. Gonzalez, M. Graña, A. d’Anjou University of the Basque Country Informatika Fakultatea, P.K. 649, 20080 Donostia, Spain E-mail: [email protected]; Fax: + 34 943 219306 Abstract. In this paper we define a Boltzmann machine for modelling probability distributions on categorical data, that is, distributions on a set of variables with a finite discrete range. The distribution model is suggested by the log-linear models and it is a generalization of the binary Boltzmann machine. High-order connections are defined instead of hidden units in order to model general probability distributions on multi-valued units. We deduce the iterative learning rule that minimizes the divergence function, which corresponds to a neural scheme. We show that this learning rule converges to the global minimum of the Kullback-Leibler divergence. An example is provided to illustrate the modelling capability of the Boltzmann machine with discrete (non-binary) units.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extending Learning Vector Quantization for Classifying Data with Categorical Values

Learning vector quantization (LVQ) is a supervised neural network method applicable in non-linear separation problems and widely used for data classification. Existing LVQ algorithms are mostly focused on numerical data. This paper presents a batch type LVQ algorithm used for classifying data with categorical values. The batch learning rules make possible to construct the learning methodology f...

متن کامل

Town trip forecasting based on data mining techniques

In this paper, a data mining approach is proposed for duration prediction of the town trips (travel time) in New York City. In this regard, at first, two novel approaches, including a mathematical and a statistical approach, are proposed for grouping categorical variables with a huge number of levels. The proposed approaches work based on the cost matrix generated by repetitive post-hoc tests f...

متن کامل

Modeling the categorical perception of speech sounds: a step toward biological plausibility.

Our native language has a lifelong effect on how we perceive speech sounds. Behaviorally, this is manifested as categorical perception, but the neural mechanisms underlying this phenomenon are still unknown. Here, we constructed a computational model of categorical perception, following principles consistent with infant speech learning. A self-organizing network was exposed to a statistical dis...

متن کامل

Probabilistic self-organizing maps for qualitative data

We present a self-organizing map model to study qualitative data (also called categorical data). It is based on a probabilistic framework which does not assume any prespecified distribution of the input data. Stochastic approximation theory is used to develop a learning rule that builds an approximation of a discrete distribution on each unit. This way, the internal structure of the input datas...

متن کامل

Outlier Detection Using Extreme Learning Machines Based on Quantum Fuzzy C-Means

One of the most important concerns of a data miner is always to have accurate and error-free data. Data that does not contain human errors and whose records are full and contain correct data. In this paper, a new learning model based on an extreme learning machine neural network is proposed for outlier detection. The function of neural networks depends on various parameters such as the structur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neurocomputing

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2000